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RECENT TRENDS IN ACfflEVEMENT 



Introduction^ 

In recent years positive trends have been noticed in the achievement test scores of 
American elementary and secondary school students, indicating a reversal of the general decline 
which occurred in the 1960s and 1970s. The score declines of those years were evident 
nationwide at all ability levels, in many subject areas, and on various kinds of tests— most notably 
on the SAT (Scholastic Aptitude Test) (Breland, 1976) and ACT (American College Testing 
Program) (Munday, 1976). Similar patterns of decline appeared on the MSAT (Minnesota 
Scholastic Aptitude Test) (Harnischfeger & Wiley, 1975) and the Iowa tests (Iowa Testing 
Programs, 1984, 1987); further confirmation of declining achievement was found in the results 
of the National Assessment of Educational Progress (NAEP) (Harnischfeger & Wiley, 1975). 
Score declines were mor^t severe among older students (Harnischfeger & Wiley, 1975) and in 
areas requiring inferential and critical thinking skills (Congressional Budget Office, 1986). 

The more recent positive trends have been somewhat less uniform, with scores increasing 
for various age groups, student ability levels, and subject areas at different points in time. In 
general, however, the scores of younger children began to increase around 1975, while older 
groups did not show similar improvements until the early 1980s (The College Board, 1986, 1988; 
Congressional Budget Office, 1986; Iowa Testing Programs, 1987; National Assessment cf 
Educational Progress [NAEP], 1986), giving the initial impression that gains were occurring 
only at the lower grade levels. 

Recent studies using data from NAEP reports have raised concerns that higher 
achievement test scores may primarily n ;t gains for relatively low-achieving students in 
low-level skills (Dossey, Mullis, lindquist, & Chambers, 1988; Forbes, 1985; Mullis & Jenkins, 
1988; NAEP, 1986). In other words, students of higher ability hav3 not matched the 
achievement gains of lower-ability students, and for all students there remains a critical 
deficiency of higher-order thinking skills (HOTS). These concerns are based primarily on the 

1 Material presented in this paper is based on the second author's master's thesis: Han, M. Y. (1988). The Nature of 
Recent Trends in the Achievement Scores of Elementary and Secondary Students in Iowa. Unpuolished master's 
thesis, The University of Iowa, Iowa City, lA. 
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tendency (within age group) of lower NAEP achievement classes to show greater improvement 
than higher achievement classes (Forbes, 1985) and on a lack of significant increase in the 
percentage of students reaching the highest proficiency levels in mathematics and science 
(Dossey et al., 1988; Mullis & Jenkins, 1988). Additional concerns about students' higher-order 
thinking abilities are raised in connection with the more qualitative writing assessment 
(Applebee, Langer, & Mullis, 1986)* 

Not all tests, however, have matched the trends apparent in the .^aEP data. A 
comparison of trends on the SAT, the ACT, and other standardized tests at the high school 
level revealed many discrepancies between tests: score changes over the 1970-1983 interval 
range from -.26 to .10 standard deviation units (Congressional Budget Office, 1986). Such 
variation suggests that no single test can provide a definitive portrayal of overall achievement 
trends. 

Objective 

In light of the concerns raised by the NAEP data, the primary objective of this study was to 
examine recent trends in achievement test scores among elementary and secondary students in 
Iowa schools. In particular, three questions v/ere addressed: 

1. Are the recent improvements in test scores greater for the lower grades than they 
are for the upper grades? 

2. Is there a difference in the magnitude or direction of score changes for students 
of different ability levels? 

3. Is there a difference in the pattern of score changes for items differing in 
complexity (lower-level versus higher-order thinking skills)? 

Data Sources 

The tests 

Form 7 of the Iowa Tests of Basic Skills (ITBS) was used to measure the achievement of 
students at Grades 3 and 7, and Form X-7 of the Iowa Tests of Educational Development 
(ITED) was employed for Grade 11. These grade levels were chosen because they correspond 
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most closely to the age groups for which the NAEP results have been reported 

The annual statewide use of the ITBS and ITED in Iowa produced data particularly useful 
for examining score changes, as the demographic composition of the test-taking population is 
extremely stable and the vast majority of Iowa students are assessed. Children at all grade levels 
are tested over a comprehensive range of subject areas. Moreover, trends in achievement 
among Iowa students have been similar to those occurring nationally (Congressional Budget 
Office, 1986). 

The Iowa Tests of Basic Skills . The ITBS battery is admim^stered in about 99% of the 
public and private schools in Iowa; the statewide testing program currently involves betweeii 
35,000 and 40,000 students in each of grades 3 through 8 The tests are intended to provide 
comprehensive and periodic measurement in fundamental academic skills. The concept of 
fundamental or "basic" skills is not limited to low-level tasks, but covers a variety of 
abilities— including higher-order cognitive competencies— which are important for a child's 
success in academic activities and related life skills. The major test areas are: 

Vocabulary (V) 

Reading Comprehension (R) 

Language 

-Spelling (LI) 

-Capitalization (L2) 

-Punctuation (L3) 

-Usage (L4) 
Work Study Skais 

-Visual Materials (Wl) 

-Reference Materials (W2) 
Mathematics 

-Concepts (Ml) 

-Problem Solving (M2) 

-Computation (M3) 
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The ITBS are designed to reflect the continuous nature of skill development. The tests 
are arranged in levels constructed to reveal progress along a continuous developmental path 
from kindergarten to Grade 9 (Hieronymus, Lindquist, & Hoover, 1982). The ITBS yield grade 
equivalent (GE) scores and within-grade percentile ranks for each of three times of year (fall, 
mid-year, and spring). 

The Iowa Tests of Educational Development . In use since 1942, the ITED are designed 
to measure essential abilities that practically all adolescents and adults must use in daily life, 
including higher-order competencies. The tests emphasize cognitive skills. They are not closely 
tied to any curriculum and attempt to measure processes rather than specific content 
knowledge. Scores reflect intellectual development which occurs as a result of out-of-school 
experiences as well the effects of school learning (Iowa Testing Programs, 1979). Results are 
reported in terms of standard scores and percentile ranks. 

The ITED assess achievements of high school students (Grades 9-12) in these areas: 

English 

- Correctness and Appropriateness of Expression (E) 
-Spelling (E) 

Quantitative - Quantitative Thinking (Q) 
Social Studies 

- Concepts and Backgrounds (SS) 
-Readmg Material (SS) 

Natural Sciences 

- Concepts and Backgrounds (NS) 
-Reading Material (NS) 

Literature - Interpretation of literacy Material (L) 
Vocabulary (V) 

Use of Sources of Information (SI) 

These tests measure competency in a vanety of important skills: recognizing the essentials of 
good writing, solving quantitative problems, critical analysis of discussions of social issues, 
understanding nontechnical scientific reports and recognizing sound methods of scientific 
inquiry, perceiving subtle meanings in literary materials, and m24king effective use of sources of 
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information and common tools of learning (Iowa Testing Programs, 1979). 
The subjects 

ITBS scores of Iowa third and seventh graders in the 1978-79 school year were compared 
with those of the same grades in 1984-85; ITED scores of eleventh graders from 1979-80 and 
1985-86 were likewise compared. Tv/o parallel test forms were administered in a 
counterbalanced fashion to the same age populations over these six-year periods, with specific 
forms alternated from year to year. Thus students at each grade level were tested with identical 
forms at each of the two assessments, permitting direct comparisons of perfonnance. 

Only those Iowa schools which administered Form 7 of the ITBS or Form X-7 of the 
ITED to all regularly enrolled students in Grades 3 and 7 (ITBS) or Grade 11 (ITED) at the 
same time of year during both assessments were included in the study. Students in these grades 
are approximately the same age as the children tested by NAEP: 9, 13, and 17. At each grade 
level in each of the selected school years, data from over 5000 students satisfying the preceding 
criteria were available. Actual numbers of students are listed by grade level and year in Tables 
1-3. ^ 

Methods and Results 

Question 1: Are the improvements in scores greater for the lower grades than for the upper 
grades? 

To answer this question, mean scores were computed at each of the three grade levels for 
each of the two assessments (1978-79/1979-80 and 1984-85/1985-86). Differences bet>\'een the 
later and earlier means were calculated. Because the data at the different grade levels were 
based on tests of differing length, these mean differences were then divided by the standard 
deviations of the earlier groups. Effect sizes are thus expressed in standard deviation units. In 
addition, ratios of standard deviations (1984-85 s.d. divided by 1978-79 s.d. for Grades 3 ana 7; 
1985-86 s.d. divided by 1979-80 s.d. for Grade 11) were computed for the purpose of examining 
changes in variability. 
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Significant positive differences (p < .001) were found between mean achievement scores 
for the two assessments in all subject areas at all grade levels. Effect sizes are similar across 
subject areas and across grade levels. Older students showed as much improvement as younger 
students. Variability declined slightly (with minor exceptions) in the second assessment. Mean 
differences and standard deviation ratios are presented by grade level and test area in Tables 
1-3; this information is averaged and summarized in Table 4. 

Question 2: Is there a difference in score changes for students of different (high versus low) 
ability levels? 

Data from Table 4 showing an increase in means and decrease in variability suggest that 
low-scoring students have shown more improvement than high -scoring students. This 
conclusion is substantiated by a comparison (between 1978-79/1979-80 and 1984-85/1985-86 
data) of the percentages of students at each of three performance levels: 1) a perfect score, 2) 
90% + correct responses* and 3) 30%- correct responses. 

In Grades 3 and 7 the proportion of students earning perfect scores increased slightly over 
the six-year interval. In Grade 11 the percentage of students obtaining perfect scores on the 
ITED was so small at both test dates that no reasonable comparison could be made. In general, 
decreases in the percentage of students with a "percent correct" score of 30 or less were greater 
in magnitude than the corresponding increases in percentage of students with "percent correct" 
scores of 90 or more. That is, the proportion of students that "moved up" the score scale is 
greater for the low-scoring group. These data are presented by grade level and test area in 
Tables 5-7; they are averaged and summarized ^'n Table 8. "Percentage change" figures should 
be interpreted with ca .ion, since the rates of change depend on differing initial base 
percentages. 



ERIC 



8 



RECENT TRENDS IN ACHffiVEMENT 



PAGE? 



Question 3; Is there a difference in score changes for items of different complexity (lower-level 
versus higher-level)? 

All items on Form 7 of the ITBS and Form X-7 of the ITED were classified as HOTS or 
non-HOTS items by two independent judges using the skills classifications and guidelines 
published by the Iowa Testing Programs (1979, 1984, 1988). Determination of the skill 
complexity required by a test item is by nature a subjective process; the procedures followed in 
classifying items :ire described in greater detail in the appendix. 

Fcr each subtest the average difficulty levels (p-values) were computed for HOTS and for 
non-HOTS items. Differences between later (1984-85/1985-86) and earlier (1978-79/1979-80) 
average p-values were calculated. These values are reported by subject area for each grade 
level in Tables 9-11. Nearly all differences are positive, evidencing improved performance on 
both higher-and lower-level skill items for all subtests. (The one exception is for the Reading 
test at Grade 7, where a slight decline occurred for lower-level items only.) The magnitude of 
change on HOTS items is similar to that on non-HOTS items. Indeed, aside from those test 
areas which are composed ^;ntirely of lower-level items, the increases are in general slightly 
greater for HOTS items. Summary data reporting p-values averaged across test areas are given 
in Table 12. 
Summary 

A major limitation of this study should be mentioned. There is a possibility that some 
teachers may have explicitly taught specific test items. The same test forms were administered 
every other year, and comparisons were made between the first and fourth administrations of 
the same form. However, it seems unlikely that this actually occurred. The tests were not 
available to the teachers throughout the year; test materials are returned to the Iowa Testing 
Programs following each administration. More importantly, school level accountability is not 
emphasized in the testing of Iowa students. Most of the testing is done in the fall, and test use 
focuses on diagnosing individual needs for the purpose of improving instruction. Explicitly 
"teaching to the test" would serve little purpose and would not be expected to occur on a grand 
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enough scale to markedly affect statewide means. In addition, if instraction were focused on 
specific items or skills, it seem most likely that this focus would be directed toward the 
lower-level items, which are more easily taught. Gains in lower-level skills would thus be much 
greater than those actually found. 

Overall, the results of this study are encouraging. In Iowa, during the period of time 
investigated, achievement test scores improved as much for junior high and high school students 
as they did for the younger children. The gap between the performance of high-scoring and 
low-scoring students was somewhat reduced; this reduction appears to have taken place without 
a decline in the overall performance of high-scoring students and was not at the expense of 
achievement on higher-order, critical thinking skills. 

Discussion 

The past two decades have seen a variety of changes in America's approach to education, 
including the 'Tjack to basics" movement of the 1970s and the more recent retum to an emphasis 
on traditional academic goali>. A possible effect of these movements can be seen in the recent 
upward trends in achievement scores of elementary and secondary school students. This study 
helps to clarify the nature of those trends. Results are largely congruent with the improvements 
noted by the NAEP for students of similar ages. However, the findings of this study differ from 
the NAEP reports (Appleb' e et al., 1988; Dossey et al., 1988; MuUis & Jenkins, 1988; NAEP, 
1986) in two encouraging ways: 

1, Although the increase in scores is greater for low-scoring students, high-scoring 
students in Iowa also showed substantial improvement between 1978-79 (1979-80) 
and 1984-85 (1985-86) at each grade level studiec^ 

2. This study found no practical differences in the changes in performance for 
higher- level versus lower-level skills; Iowa students' performance increased on 
all types of items. 
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Obviously, given the sample of Iowa students, we cannot be certain of the extent to which 
the results of this study hold true for students across the nation. Iowa students, on the average, 
tend to score higher than the national average on standardized achievement tests. This may 
partially account for the difference between our findings and the NAEP data. If a relatively 
small percentage of Iowa students are "low achievers," perhaps less school time is required for 
remediation and concentration on low-level skills, leaving room for greater emphasis on 
higher-order skills and providing more opportunities for high-scoring students* 

Further explanation of the difference from the NAEP findings might be found in the 
general lack of unifomrty of achievement score trends. Even the NAEP assessments show no 
difference in improvement in lower-order versus higher-order thinking skills for some groups 
on some tests. In reading, for example, 9-year-olds exhibited similar patterns of score changes 
at different levels of skill complexity in the 1970s (NAEP, 1981); at all ages over the period 
1980-1984, score changes in literal comprehension differed only slightly (not significantly) from 
those in inferential comprehension (NAEP, 1986). 

A difference in this study's approach to data analysis may also be relevant. We have 
conceptualized higher-order processes in terms of what tbe item requires of the test taker. The 
NAEP data are summarized in terms of proficiency level; for most tests, items are not explicitly 
divided into higher-order versus lower-order classifications. Items of varying complexity may 
appear at all levels of proficiency. In the NAEP reading tests, although the difficulty of 
passages increases at higher proficiency levels, attendant item sets include both literal and 
inferential comprehension questions at all levels (NAEP, 1986). An examination of sample 
i'lems in math and science indicates that higher proficiency levels are marked by advanced 
content as well as advanced reasoning skills, and some items reflect simple reasoning or lecall 
(non-HOTS) of advanced material (see for example Dossey et al., 1988, p. 41). 
High-proficiency items in science, especially, seem to rely on advanced content as well as 
higher-order processing for their difficulty. Possible improvements in higher-order thinking 
skills may thus be obscured as HOTS and non-HOTS items are interwoven with proficiency 
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levels. It would be interesting to see what results the NAEP data would yield if trends in 
higher-order thinking were examined at the itvm level. 

The NAEP assessments clearly indicate ths*, as a group, American students have not 
attained the academic proficiency levels we would like; achievement goals which appear to be 
reasonable have not yet been met, partiailarly in math and science. However, it is not clear that 
this academic shortfall reflects a lack of complex thinking abilities. The results of this study 
indicate that complex cognitive skills have not fallen by the wayside in Iowa schools. Although 
Iowa students cannot be said to constitute a nationally representative sample, the pattern of 
improvements they have shown implies that the renewed emphasis on hasi^ skills and traditional 
academic goals nee d not be detrimental to high-achieving students nor require a neglect of 
higher-order thinking skills. 
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DEHNITION AND CLASSIFICATION OF SKILL LEVEL 

Some test items are easily classified as requiring higher-order thmkiiig skills (HOTS) or as 
demanding less complex mental processing (non-HOTS). Most educators would agree, for 
example, that an item which asks a student simply to identify an error of capitalization or 
punctuation would fall in the non-HOTS category. Many test items, however, are more difficult 
to categorize. These items must be examined individually and subjectively, and experts may 
disagree in then: classifications. Whether or not they concur on classification, experts share the 
opinion that some test items may require higher-order thinking functions of some examinees 
but not of others (Iowa Testing Programs, 1988). For example, one test taker may be able to 
answer a question correctly by recalling and executing a simple memorized formula while 
another examinee, who has not leamed the formula, must actively reason through the problem. 
Any HOTS/non-HOTS classification scheme must therefore be regarded as somewhat 
arbitrary. The following paragraphs describe the criteria by which ITBS and ITED items were 
categorized as HOTS or non-HOTS for the purposes of this study. 

The Iowa Tests of Basic Skills (ITBS). For the ITBS, classification procedures are 
described in a supplemental guide (Iowa Testing Programs, 1988). ITBS items included in the 
HOTS category require the exercise of relatively complex cognitive functions. Simple recall of 
information is insufficient; the student must actively use information in an operative sense to 
achieve a goal. The general conception of higher-order thinking encompasses the cognitive 
operations of interpretation, inference, classification, analysis, and comparison. 

All items in the ITBS Vocabulary test (V) are classified as non-HOTS. 

For the Reading test (R), items based on recognition of literal meaning or comprehension 
of stated facts or relationships are regarded as non-HOTS. Those that require the student to 
use inductive reasoning, or to draw conclusions or make predictions on the basis of information 
given or implied in the passage are designated as HOTS. The content of distractors was also 
taken into account in the classification process; if two or more response choices can easily be 
ruled out on the basis of information presented literally in the passage, the item is labeled 
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non-HOTS* Presmnably examinees may be able to arrive at the correct answer by eliminating 
literal distractors without exercising complex thinking skills* 

For the Language tests (LHj4), none of the Spelling, Capitalization, or Punctuation items 
are labeled as HOTS* Although some test takers may engage in complex thinking on some of 
these items, there is no way of knowing whether a given item was answered correctly by 
reasoning or by mere recall* Overall the level of thinking demanded by these tests appears to be 
insufficiently complex, in comparison with the conception of HOTS for other tests, to warrant a 
HOTS classification* It is assumed that most of the items can be answered by most students via 
simple recall* 

In the current Forms G and H of the ITBS, higher-order thinking skills are required 
throughout the Usage and Expression (L4) test* Form 7, however, did not contain any 
"effectiveness of expression" items, which represent the majority of the G and H lA HOTS 
items; therefore it was decided for the purposes of this study to label all lA items as non-HOTS 
also. 

For the Work Study tests (W1-W2), items that require the use of multiple thinking steps in 
usmg maps or charts are classified as HOTS* On the other hand, the use of a chart or map to 
describe or retrieve direct information is designated as non-HOTS* 

For the Mathematics tests (M1-M3), items that require multiple comparisons, multiple 
operations, multiple thought stages, or a complex conversion of information are regarded as 
HOTS* Even a multi-step item may be classified as non-HOTS, however, if its solution is based 
on a rote scheme or if it is very similar to questions the student is likely to have encountered 
during instruction* Such items are considered to require more recall than reasoning* 

The item numbers of ITBS Form 7 items classified as HOTS are listed in Table A-1* 

The Iowa Tests of Educational Development (ITED) * For the ITED, all items on the 
Vocabulary test (V) are labeled as non-HOTS* Skill classifications for the Social Studies (SS), 
Natural Science (NS), Literary Materials (L), and Use of Sources of Information (SI) tests are 
provided in the ITED Manual for Teachers* Counselors, and Examiners (Iowa Testing 
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Programs, 1979)* The classification scheme is structured around three levels: knowledge and 
comprehension, application of principles and interpretation, and critical analysis. For tests SS, 
NS, L, and SI, all items in the first level (knowledge and comprehension) are classified as 
non-HOTS. Items in the remaining two levels are combined and classified as HOTS* These 
items involve relatively complex tasks such as inferring unstated relationships, predicting 
outcomes, distinguishing fact from opinion, identifying logical consequences of given 
propositions, and formulating testable hypotheses* 

Items on the ITED tests of Correctness and Appropriateness of Expression (E) and 
Quantitative Thinking (Q) were examined individually. Those involving mere recognition, 
mechanical application of rules, or simple, straightforward computation are classified as 
non-HOTS* Items requiring students to set up multi-step mathematical solutions, draw logical 
conclusions, judge the appropriateness of a statement's content or style, or make use of 
contextually implicit information are designated as HOTS. 

The item numbers of ITED Form X-7 items classified as HOTS are listed in Table A-2. 



18 



TABLE 1 

Summary Data 1978-79 and 1984-85 
ITBS Fonn 7, Grade 3 



Test 


V 


R 


LI 


L2 


L3 


L4 


WI 


W2 


Mi 


M2 


M3 


84-85 M'-an 


22.45 


30.34 


22.36 


20.34 


17.65 


17.46 


24.66 


24.32 


19.58 


14.86 


25.68 


78-79 Mean 


21.39 


28.60 


21.38 


19.01 


16.60 


16.30 


23.57 


23.00 


19.10 


14.22 


24.26 


Mean Difference 


1.06 


^ 1.74 


.98 


1.33 


1.05 


1.16 


1.09 


1.32 


.48 


.64 


1.42 


Mean Biff, in Ounits 


J6 


.19 


.15 


.22 


.18 


.20 


.16 


.17 


.09 


.13 


.18 


84-85 S.D. 


5.70 


8.21 


5.81 


5.49 


5.76 


5.43 


6.09 


7.20 


4.80 


4.90 


7.51 


78-79 S.D. 


6.53 


8.97 


6.57 


6.11 


5.92 


5.81 


6.66 


7.61 


5.34 


5.01 


7.95 


S.D. Ratio 


.87 


.92 


.88 


.90 


.97 


.93 


.91 


.95 


.90 


.98 


.94 


84-85 N 


5359 


5360 


5362 


5362 


5359 


5359 


5336 


5336 


5338 


5333 


5334 


78-79 N 


6637 


6637 


6435 


6430 


6430 


6431 


6428 


6425 


6636 


6633 


6597 



ERIC 
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TABLE 2 

Summary Data 1978-79 and 1984-85 
ITBS Form 7, Grade 7 



X Col 


V 


K 


LI 


L2 


L3 


L4 


Wl 


W2 


Ml 


M2 


M3 


84-85 Mean 


29.52 


34.55 


29.58 


20 87 


21 0*; 




97 10 


^11 9fi 




lo. /I 


no 

z/.yo 


78-79 Mean 


28.35 


33.47 


27.87 


19.89 


20.03 


19.76 


25.74 


30.31 


23.97 


18.03 


26.29 


Mean Difference 


1.17 


1.08 


1.71 


.98 


1 02 


1.05 


1.45 


.95 


.58 


.68 


1.69 


Ms an Diff. in Ounits 


.14 


.10 


.18 


.16 


.16 


.27 


J5 


.i2 


.07 


.12 


.20 


84-85 S.D. 


7.85 


9.84 


8.75 


5.69 


5.93 


5.75 


9.07 


7.94 


7.90 


6.05 


8.68 


78-79 S.D. 


8.65 


10.36 


9.47 


6.05 


6.37 


6.18 


8.88 


8.33 


7.98 


6.04 


8.58 


S.D. Ratio 


SI 


.95 


.92 


.P4 


.P5 


.93 


1.02 


.95 


.PP 


1.00 


2.02 


84-85 N 


6795 


6800 


6791 


6790 


6788 


6788 


6783 


6785 


6793 


6792 


6791 


78-79 N 


7264 


7264 


7264 


7258 


7256 


7253 


7250 


7252 


7253 


7250 


7219 



ERIC 
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TABLES 

Summary Data 1979-80 and 1985-86 • 
ITED Forni X-7, Grade 11 



Test 


E 


Q 


SS 


NS 


L 


V 


SI 


85-86 Mean 


44.11 


19.04 


36.14 


32.76 


27.47 


19.69 


27.95 


79-80 Mean 


41.51 


17.75 


34.17 


30.79 


25.55 


18.19 


25.63 


Mean Difference 


2.60 


1.29 


1.97 


1.97 


1.92 


1.50 


2.32 


Mean Dsff. in c^nits 


.20 


.16 


.16 


.17 


.21 


.17 


.28 


85-86 S.D. 


12.18 


7.77 




11.61 


8.89 


8.86 


8.10 


79-80 S.D. 


13.30 


7.84 


12.65 


11.80 


9.22 


8.93 


8.32 


S.D. Ratio 


.91 


.99 


.98 


.98 


.96 


.99 


.97 


85-86 N 


6321 


6321 


6321 


6321 


6321 


6321 


6321 


79-80 N 


7731 


7731 


7731 


7731 


7731 


7731 


7731 



ERIC 
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TABLE 4 

Changes in Average Performance (aunits) 
and Variability (S.D. Ratio) Across Grades 



Grade 


3 




7 


11 


Average of Mean 
Differences 
in CTunits 


.17 




.14 


.19 


Average of S.D. Ratios 


.92 




.96 


.97 



ERIC 
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TABLES 

Percentage of Students Scoring at Extremes 
1978-79 and 1984-85, Grade 3 



TES 


T 


V 


R 


LI 


L2 


L3 


L4 


Wl 


W2 


Ml 


M2 


M3 


% of students 


1984-85 


3.1 


.2 


3.9 


3 0 


7 


. VI 




•a 
.J 


0 


Z.3 


1.1 


getting 


1978-79 


2.5 


,1 


3.8 


2.3 


.5 


4 


•a 
.J 


•a 




1.0 


.D 


D6rfect <5rorp^ 




. Q 


1 

. I 


1 

. 1 


7 
. / 


./ 


.z 


.2 


fx 

0 


-.3 


.5 


.6 


% of students 


1984-85 


27.3 


10.7 


27.3 


25.5 


11.5 


11.6 


11.9 


12.2 


14.6 


13.4 


12.5 


getting 


1978-79 


25.5 


8.4 


25.8 


20.5 


8.2 


8.6 


10.4 


9.3 


15.3 


10.8 


9.7 


90% or more 


Difference 


1.8 


2.3 


1.5 


5.0 


3.3 


3.0 


1.5 


2.9 


-.7 


2.6 


2.8 


items right 


% change 


7J 


27.4 


5.8 


24.4 




34.9 


14.4 


31.2 




24.1 


28.9 


% of students 


1984-85 


3.6 


3.7 


4.0 


4.0 


8.1 


7.9 


3.2 


6.0 


2.6 


8.6 


5.2 


getting 


1978-79 


7.1 


7.0 


7.2 


7.4 


11.2 


12.8 


5.8 


9.0 


4.3 


11.5 


8.5 


30% or fewer 


Difference 


-3.5 


-3.3 


-3.2 


-3.4 


-3.1 


-4.9 


-2.6 


-3.0 


-1.7 


-2.9 


-3.3 


items right 


% change 


-493 


'47 J 


-44.4 


-45S 


-27.7 


-383 


-44.8 


-333 


-39.S 


-25.2 


-38.8 
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TABLE 6 

Percentage of Students Scoring at Extremes 
1978-79 and 1984-95, Grade 7 



TES 


T 


V 


R 


LI 


L2 


L3 


L4 


1 

Wl 


W2 


Ml 


I 

M2 


M3 


% of students 
getting 
ycrici/i scores 


1984-85 
1978-79 
Differ en ce 


.5 
.9 
-.4 


0 
0 
0 


1.3 

> 

.8 
.5 


.9 
.5 
.3 


1.1 
.9 
.2 


.6 
.4 
.2 


0 
0 
0 


.1 
.1 

0 


.1 

0 

.1 


.6 
.5 
.1 


.7 
.4 
.3 


% of Students 

getting 
90% or more 
items right 


1984-85 
1978-79 
Difference 
% change 


11.5 
11.4 
.1 
.9 


.4.0 
3.7 
.3 
8.1 


15.5 
13.1 
2.4 
18J 


10.8 
9.1 
1.7 

18.7 


14.3 
11.6 
2.7 
23J 


10.4 
8.5 
1.9 

22.4 


1.2 
.6 
.6 

100.0 


8.1 
6.9 
1.2 
17.4 


4.2 
3.6 
.6 
16.7 


8.6 
6.4 
2.2 
34.4 


8.3 
5.4 
2.9 
53./ 


% of students 

getting 
30% or fewer 
items right 


1984-85 
1978-79 
Difference 
% change 


3.5 
6.6 
-3.1 
-47.0 


4.6 
7.1 
-2.5 
■352 


5.9 
9.8 
-3.9 
•39.8 


3.8 
6.2 
-2.4 
-38.7 


4.0 
6.9 
-2.9 
-42.0 


4.6 
7.6 
-3.0 
-39.5 


12.7 
16.0 
-3.3 
-20.6 


2.9 
4.2 
-1.3 
-31.0 , 


9.3 
11.2 
-1.9 
■17.0 


8.9 
10.4 
-1.5 
-14.4 


6.6 
9.3 
-2.7 
-29.0 
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TABLE? 

Percentage of Students Scoring at Extremes 
1979-80 and 1985-86, Grade 11 



TEST 


E 


0 


SS 




T 

Lj 


V 
V 




% of students 


1985-86 


n 
u 


A 


A 
U 


A 

u 


A 
U 


•4 


0 


getting 


1979-80 






A 
U 


A 
U 


A 
U 


.4 


0 


perfect scores 


Difference 


0 


.2 


0 


0 


0 


0 


0 


% of students 


1985-86 


4.6 


6.0 


6.5 


3.2 


3.8 


5.0 


3.8 


getting 


1979-80 


3.5 


5.0 


5.5 


2.5 


2.9 


4.1 


2.2 


90% or more 


Difference 


1.1 


1.0 


1.0 


.7 


.9 


.9 


1.6 


items right 


% change 


31.4 


20.0 


183 


28.0 


5i.O 


22.0 


72.7 


% of students 


1985-86 


4.9 


19.2 


8.8 


11.2 


9.7 


24.3 


5.8 


getting 


1979-80 


9.0 


24.9 


12.4 


16.2 


14.3 


31.3 


10.0 


30% or fewer 


Difference 


-4.1 


-5.7 


-3.6 


-5.0 


-4.6 


•7.0 


-4.2 


items Tifthi 


% change 


-45.6 


-22.9 


-29.0 


-30.9 


-32.2 


-22,4 


-42.0 



ERIC 
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TABLES 

A/cragc Percentage of High-Scoring (90%+ correct) 
and Low-Scoring (30%- correct) Students 
and Percentage Change in Average Percentage 
from Earlier to Later Assessment 



Grade 


3 


7 




11 


High-Scoring 










1984-85 


16.2% 


8.8% 


1985-86 


4.7% 


1978-79 


13.9% 


7.3% 


1979-80 


3.7% 


% chance 


17 Mo 


20J% 


% change 


28.1% 


Low-Scoriing 










1984-85 


5.2% 


6.1% 


1985-86 


12.0% 


1978-79 


8.4% 


8.7% 


1979-80 


16.9% 


% change 


-38.1% 


'29.9% 


% change 


'28.9% 
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TABLE 9 

1978-79 and 1984-85 P-Values by Skill Level 
ITBS Fonn 7, Grade 3 



T 


EST 


V 


R 


LI 


L2 


L3 


L4 


Wl 


W2 


Ml 


M2 


M3 


Total 


Number of items 
1984-85 p-value 
1979-78 p-value 
Difference 


30 
.748 
.713 
.035 


44 
.690 
.650 
.040 


30 
.745 
.713 
.032 


28 
.726 
.679 
.047 


28 
.630 
.593 
.037 


27 
.647 
.604 
.043 


36 
.685 
.655 
.030 


37 
.657 
.622 
.035 


28 
.699 
.682 
.017 


23 
.646 
.618 
.028 


39 
.658 
.622 
.036 


Higher-Level 
Skill 
(HOTS) 


Number of items 
1984-85 p-value 
1978-79 p-value 
Difference 




21 
.626 
.589 
.037 










21 
.641 
.611 
.030 


9 

.680 
.626 
.054 


3 

.677 
.660 
.017 


10 
.525 
.494 
.031 




Lower-Level 

Skill 
(non-HOTS) 


Number of items 
1984-85 p-value 
1978-79 p-value 
Difference 




23 
.748 
.706 
.042 








■ 


15 
.747 
.716 
.031 


28 
.650 
.620 
.030 


25 
.702 
.685 
.017 


13 

.739 
.714 
.025 


■ 1 
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TABLE 10 

1978-79 and 1984-85 P-Values by Skill Level 
ITBS Fonn 7, Grade 7 



T 


EST 


V 


R 


LI 


L2 


L3 


L4 


Wl 


W2 


Ml 


M2 


M3 




Number of items 


43 


57 


43 


31 


31 


31 


52 


47 


42 


30 


45 


Total 


1984-85 p-value 


.687 


.606 


.688 


.673 


.679 


.671 


.523 


.665 


.585 


.624 


.622 




1978-79 p-value 


.659 


.587 


.648 


.642 


.646 


.637 


.495 


.645 


.571 


.601 


.584 


— — >_^_ 


Differ en ce 


.028 


.019 


.040 


.031 


.033 


.034 


.028 


.020 


.014 


.023 


.038 


Highcr-Level 


Number of items 




36 










39 


14 


7 


20 




Skill 


1984-83 p-value 




.588 










.506 


.613 


.510 


.618 




(HOTS) 


1978-79 p-value 




.570 










.481 


.590 


.496 


.599 






Difference 




.018 










.025 


.023 


.014 


.019 




Lower-Level 


Number of items 




21 










13 


33 


35 


10 




Skill 


1984-85 p-value 




.589 










.573 


.687 


.599 


.635 




(non-HOTS) 


1978-79 p-value 




.617 










.538 


.668 


.586 


.605 






Difference 




-.028 










.035 


.019 


.013 


.030 





o 
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TABLE 11 

1979-80 and 1985-86 P-Values by Skill Level 
ITED Fonn X-7, Grade 11 



TEST 


E 


Q . 


SS 


NS 


L 


V 


SI 




Number of items 


69 


36 


60 


60 


46 


40 


46 


Total 


1985-86 p-value 


.639 


.529 


.601 


.545 


.595 


.492 


.608 




1979-80 p-va!ue 


.602 


.493 


.571 


.514 


.557 


.456 


.558 




Difference 


.037 


.036 


.030 


.031 


.038 


.036 


.050 


Higher-Level 


Number of items 


28 


13 


40 


42 


41 




14 


Skill 


1985-86 p-value 


.611 


.512 


.598 


.548 


.593 




.553 


(HOTS) 


1979-80 p-value 


.572 


.481 


.563 


.520 


.553 




.486 




Difference 


.039 


.031 


.035 


.028 


.040 




.067 


Lower-Level 


Number of items 


41 


23 


20 


18 


5 




32 


Skill 


1985-86 p-value 


.659 


.539 


.609 


.538 


.614 




.632 


(non-HOTS) 


1979-80 p-value 


.622 


.500 


.588 


.501 


.582 




.590 




JXfferen ce 


.037 


.039 


.021 


.037 


.032 




.042 
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TABLE 12 
Average Differences 
Between 1984-85/1985-86 and 1978-79/1979-80 p-Values 
for Higher- and Lower-Level Skill Items 



Grade 


3 


7 


11 




Higher-Level Skills 


.034 


.020 


.040 




Lower-Level Skills 










(test areas containing 
higher-level skill items) 


.029 


.014 


.035 




Lower-Level Skills 










(test areas containing no 
higher-level skill items) 


.038 


.034 


.036 
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